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Abstract 


We study the following problem, introduced by Chung et al. in 2006. We are given, online or offline, 
a set of coloured items of different sizes, and wish to pack them into bins of equal size so that we use 
few bins in total (at most a times optimal), and that the items of each colour span few bins (at most fi 
times optimal). We call such allocations (a, /3)-approximate. As usual in bin packing problems, we allow 
additive constants and consider {a, fi) as the asymptotic performance ratios. We prove that for e > 0, if 
we desire small a, no scheme can beat (1 + e, n(l/e))-approximate allocations and similarly as we desire 
small /3, no scheme can beat (1.69103,1 + ^(-approximate allocations. We give offline schemes that come 
very close to achieving these lower bounds. For the online case, we prove that no scheme can even achieve 
(0(1), 0(l))-approximate allocations. However, a small restriction on item sizes permits a simple online 
scheme that computes (2 -|- e, 1.7)-approximate allocations. 

1 Introduction 

We consider the problem of computing locality-preserving allocations of coloured items to bins, so as to 
preserve locality (colours span few bins) but remain efficient (use a few total bins). The problem appears 
to be a fundamental problem arising in allocating files in peer-to-peer networks, allocating related jobs to 
processors, allocating related items in a distributed cache, and so on. The aim is to keep the communication 
overhead between items of the same colour small. One application for example appears in allocating jobs in a 
grid computing system. Some of the jobs are related in a such a way that results computed by one job is used 
by another one. There are also non-related jobs that may be from different users and contexts. Related jobs 
are of a same colour and each job has a length (number of instructions for example). In the grid environment 
each computer has a number of instructions donated by its owner to be used by the grid jobs. This way the 
objective is to allocate jobs to machines trying to use few machines (bins) respecting the number of instructions 
available (bins size), while also trying to keep related jobs together in as few machines as possible. In peer-to- 
peer systems a similar problem also appears where one want to split pieces of files across several machines, 
and want to keep pieces of a file close together to minimize the time to retrieve the entire file. 

These problems can be stated as a fundamental bi-criteria bin packing problem. Let / be a set of items, 
each item of some colour c € C, and denote by L the set of items of a given colour c. Denote by OPT(I) 
the minimum number of bins necessary to pack all items and denote by OPT(/c) the minimum number of 
bins necessary to pack only items of colour c, i.e, as if we had a bin packing instance with items Ic- Let A{I) 
be the number of bins generated by algorithm A when packing all items, and for each colour c, let A{Ic) be 
the number of bins of this packing having items of colour c. We say that items of colour c span A{Ic) bins 
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in this packing. We want an algorithm that minimizes both ratios and maxcgc ) ■ So we would 

like to allocate the items to bins so that we use few bins in total (at most qOPT(/), where we call a the bin 
stretch), and the items of each colour c span few bins (at most /30PT(Ic), where we call /3 the colour stretch). 
We call such allocations (or packings) {a, /?)-approximate. The problem of minimizing any one of a or /3 is 
equivalent to the classical one-dimensional bin packing, but as we show, in general it is not even possible to 
minimize them simultaneously. A natural extension is to consider bins as nodes of some graph G, and we 
want to allocate bins so that each subgraph Gc induced by nodes containing items of colour c has some natural 
property allowing small communication overhead, such as having low diameter, or small size. 

We prove that for e > 0, if we desire small bin stretch, no scheme can beat (1 -|- e, n(l/e))-approximate 
allocations and similarly as we desire small colour stretch, no scheme can beat (1.69103,1 -|- e)-approximate 
allocations. We give offline schemes that are based in well know bin packing algorithms and yet come very 
close to achieving these lower bounds. We show how to construct (1 + e, fl(l/e)) and (1.7,1 -|- e) approx¬ 
imate allocations, the first one closing the gap with the lower bound and the last one almost closing the gap. 
For the online case, we prove that no scheme can even achieve (0(1), 0(l))-approximate allocations. How¬ 
ever, a small restriction on item sizes permits a simple online scheme that computes (2 -|- e, 1.7)-approximate 
allocations. 

2 Preliminaries 

We now formulate the problem of computing locality-preserving allocations as a coloured bin packing prob¬ 
lem. We are given a set / of n coloured items each item e with a size s(e) in (0,1] and with a colour c(e) 
from C = {1,..., m}, and an infinite number of unit-capacity bins. Let Ic be the set of colour-c items, and 
denote by OPT(/) (OPT(/c) respectively) the smallest possible number of bins needed to store items in I (Ic 
respectively). For a packing P of items I, define P{I) as the number of bins used to pack I, and define Pc{I) 
as the number of bins spanned by colour-c items in the packing P. When I is obvious, we drop it and write P 
and Pc- 

We define an {a, P)-approximate packing as one where: (1) P ^ aOPT(/) -|-0(1) and (2) for each colour 
c £ G, Pc ^ /30PT(/c) -|- 0(1). An algorithm that always produces (a,/3)-approximate packings is called 
an {a, P)-approxiniation algorithm. 

As usual in bin packing problems, we allow additive constants and consider a (respectively /3) as the 
asymptotic performance ratio as OPT(I) (respectively OPT(/c)) grows to infinity (and hence the total weight 
of items). This is because a simple reduction from PARTITION (eg see ifTOl l shows that, without allowing 
additive constants, it would be NP-hard to do better than (1.5 — e, (5) or (<5,1.5 — e) approximate packings for 
any 5. 

When dealing with the online problem we have similar definitions for the competitive ratio of an online 
algorithm, and in this case OPT(/) corresponds to an optimal offline solution to instance I that has full 
knowledge of the request sequence I. As standard, we shall use the term approximation ratio interchangeably 
with competitive ratio when discussing online algorithms (ie a 2-approximate online scheme is one that is 
within a factor 2 of the optimal offline scheme). 

2.1 Related work 

Chung et al.||4l consider the case where each item is of a different colour and can be fractionally (arbitrarily) 
divided between bins, bins have different sizes and the total weight of items exactly equals the total weight 
of bins. They show how to compute an allocation that is asymptotically optimal for each colour. By contrast, 
we relax the assumption that we must exactly fill all the bins, and consider the case of indivisible allocations. 
In this setting, the problem is much more interesting: it is impossible to get arbitrarily good (1 + e, 1 -|- e)- 
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approximate allocations in general. Thus, these relaxed packings have a tradeoff between bin stretch and 
colour stretch, with polynomial-time approximations. We also consider for the first time the case where items 
arrive online. However, the case of heterogenous bins is open for our setting. 

The nonexpansive hashing scheme of Linial and Sasson ifT^ can also be used to find a localify-preserving 
packing for unif-size items. By defining fhe disfance of fwo ifems fo be 0 if fhey are of fhe same colour, and 
5 > 1 ofherwise, one can interpref fheir dynamic hashing resulf as follows: for any e > 0, if is possible fo hash 
unif-size ifems info bins in 0(1) time so fhaf fhey have use 0(0PT^^^) bins (giving bin sfrefch O(OPT^) 
and colour sfrefch 0(1). 

Krumke ef al. lfldl sfudy a relafed ‘online coloured bin packing’ problem where fhe goal is fo minimize 
fhe number of differenl colours packed info each bin, while using fhe entire capacify of each bin (in fheir 
problem all items have same unit size). However, this problem is quite different to ours. In particular, an 
optimal solution problem when minimizing the number of colours per bin may give arbitrarily bad bin stretch. 
Consider b bins of capacity x, and unit size items of many colours ci, C 2 ,..., C{^^- 2 )h+i- There will be 2b items 
of colour Cl and 1 item of each of the other colours. Now, a (1,1)-approximate packing places x colours 
from {c 2 ...C( 3 ,_ 2 )fe+i} into each bin and the items of ci into the remaining bins. On the other hand, a packing 
minimizing the maximum number of colours per bin (while using all the capacity of each bin) will place 2 
items of ci and x — 2 items of other other colours into each bin. Hence, considering colour 1, it may be packed 
individually into OPT(/i) = 2b jx bins, but in this solution it spans b bins, giving colour stretch x/2, which 
can be made arbitrarily large. 

There are some other variants of bin packing problems with colours, for example the so called Colored Bin 
Packing that has the restriction that items of a same colour cannot be packed next to each other on a same bin. 
Approximation algorithms for the online version of the problem were presented by Bdhm et al lH and Dosa 
and Epstein lH. A generalization where the objective is to pack a graph G into another graph H where nodes 
into H have capacities and nodes in G corresponds to items of given size, was studied by Bujtas et al. in Q 

The ‘class-constrained bin packing problem’, studied by Golubchik et al. lfTTI . Kashyap et al. lfTSl . Xavier 
et al. lfTSl and Epstein et al.||71 is a coloured bin packing problem. The aim is to minimize the number of bins 
used, subject to the constraint that each bin contains items from at most c different colours (and subject to its 
capacity constraints). This problem has applications in developing algorithms for data placement on parallel 
disk arrays. Again, optimal solutions to this problem may be arbitrarily far from good if we wish to minimize 
colour stretch. 

A recent survey covering different variants of bin packing problems was made by Coffman et al. 0. 


3 Impossibility results for Offline Algorithms 

We start by considering some lower bounds on what values of bin and colour stretch can be achieved simul¬ 
taneously. All these bounds hold for offline algorithms, so there is some inherent tension between the two 
measures of colour stretch and bin stretch. We now show a lower bound on colour stretch, if we wish to take 
bin stretch arbitrarily small. 

Theorem 1 For bin stretch (1 -|- e), it is impossible to achieve better than Lt{l/e) colour stretch. 

Proof. Eor 0 < 6 < 1/2, consider the instance containing n items of size 1 — 5, one for each colour 
1, ..., n and n items of size 5, all of colour n + 1. Eor simplicity assume that e = l/x for some integer x, 
and 5 = 2e, such that 5n and en are integers. We have OPT(/) = n, OPT(/c) = 1 for c = 1,..., n and 
OPT(/„_|_i) = 5n. Assume we want to construct a packing using at most (1 -|- e)n bins in total. 

Since items of colours c = 1,..., n can not fit together, we use at most ne bins only for colour n + 1, then 
at least n — {ne/5) = n/2 colour-(re -|- 1) items overflow. Therefore at least re/2 + en = re(l/2 -|- e) bins 
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are used for colour-(n + 1) items in any packing using at most n(l + e) bins. Since OPT(/„+i) = 6n, the 
number of bins for colour- (n -I- 1) is at least 

OPT(4+i)(l/2 + e)/<5 = OPT(4+i)(l/(4e) + l/2) 

= f2(l/e)OPT(4+i). 

The construction holds for 5 < 1/2, so it is valid for e < 1/4. □ 

On the other hand, if one wants to keep low colour stretch, no bin stretch smaller than 1.69103 can be 
achieved. 

Theorem 2 For colour stretch (1+e), it is impossible to achieve bin stretch better than 1.Q9103, for sufficiently 
small e. 


Proof. Consider the following Sylvester sequence with Iq = 1, Ijj^i = lj{lj + 1). For some constant m, we 
assume we have items of (m + 1) different colours where, for colour Cj, i = 0,..., m, we have a list of n 
items each one with size + e, where e is a small enough constant that depends on the value of m. Note 
however that m is independent of e, and its value will be defined later. For each colour c, an optimal packing 
OPT(/ci) for colour stretch uses ^ bins for z = 0,..., m, each bin containing exactly k items. We assume 
for simplicity and w.l.o.g that Zj divides n. Let P be the packing corresponding to the union of the optimal 
packings OPT(/ci) for each colour. Notice that the bins of this packing cannot be joined together. 

Now consider an optimal packing P* for bin stretch, but which has colour stretch at most (1 + e). We will 
show that if the packing has colour stretch (1+e), then most of the bins for each colour Cj are packed like the 
optimal colour stretch packing. So P* uses almost the same number of bins as P. 

For some colour Cj, let kj" be the number of bins in P* that contain exactly j items for j = 1,..., k- 
We want to upper bound the number of bins that contain less than Zj items, which is kjk Since P* has 

colour stretch (1 + e), each colour q must span at most (1 + e)n(li bins. The number of bins used to pack 
items of colour q can be bounded as follows: 




1 b 1 


1=1 


1=1 


so we can write 


/j 1 /j 1 Zj 1 

E E = E[(^* - ^ 

1=1 1=1 1=1 


en 


Since (Zj — j) ^ 1 for j = 1,..., Zj — 1, the number of bins in P* not containing exactly Zj items of colour Cj is 
at most ^ Hence there are at least n — {k — l)en items that must be packed in bins containing 

Zj items. So at least 

= ^(1 - ft - i)e) 

n n 

bins are used to pack only items of colour Cj. Note that m is a constant and then (Z^ — 1) is 0(1). The packing 
P* must use at least 

m 




i=0 
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bins, while an optimal solution for bin stretch uses exactly n bins by packing one item of each colour in a bin, 
so the bound 

P*{I) > V 1 - q- l)g 

OPT(I) ^ ^ k 

m — 

holds for m ^ 5 and sufficiently small ^ ^ m- ° 

Somewhat suprisingly, the two correct bounds are not symmetric - the upper bounds in the next section 
show that we can indeed achieve (0(1), 1 + e)-approximation schemes. 

4 Offline Algorithms 

4.1 A (1 + £, 0(l/£))-approximation algorithm 

We now describe how to achieve asymptotically the bound in Theorem [U We shall make use of the APTAS of 
Fernadez de La Vega and Lueker HI (VL), which operates as follows: 

The APTAS VL: fix some e > 0, and separate items I into small Is (< e) and large e). For the large 

items, sort them by increasing size and partition them into K = 1/e^ groups, each of at most ne^ items. 

Round each item up to the size of the largest item in its group, to obtain an instance J. 

Each bin contains at most 1/e items from //, so the total number of different bin types is at most t = 
(^^1, and the total number of possible packings using at most n bins is at most (”^*), which is polynomial 
in n. Therefore we can enumerate these packings and choose the best one. Since we have rounded all items up 
in size, a packing of the rounded up items gives a valid packing of the original items. The following elegant 
domination argument (from iHl) shows that an optimal packing for the rounded up items uses at most a factor 
(1 + e) more bins than packing the original items: consider rounding down item sizes to the smallest in the 
group to obtain an instance J'. Then a packing for J' gives a packing for all but the largest group in J, which 
contains at most ne^ items. Since each item has size A e, we have OPT(A) ^ ne. Thus, 

OPT(J) A OPT(J') + ne^ A (1 + e)OPT(/;). 

Now take the small items Ig and pack them into the remaining free space using first fit (FF). If we do not 
open more bins, then we already have at most (1 + e)OPT(/) bins. If we need to add more bins, then clearly 
each bin except at most 1 is full to at least 1 — e. In this case, we have at most OPT(/)/(l — e) + 1 A 
(1 + 2e)OPT(/) + 1 bins. 

Our modification: For our problem, we can use the rounding step, but we cannot use the FF step for small 
items (as some colours may be spread over many bins). However, a small change fixes fhis: group small items 
by colour, pack each group using FF info existing bins having more fhan 2e of free space, fhen open more bins 
if necessary. Wifh fhis idea, for each colour c, every bin (excepf af mosf 1) eifher confains af least e weight of 
colour c, or no items of colour c (if a bin contains a large item this is clearly true, and if not, since we used FF 
and each bin has at least 2e of free space, at least half of this space is used). 

So each colour spans at most OPT(/c)/e bins, giving the desired colour stretch. For the bin stretch, the 
argument is similar to the one above - if new empty bins are used when packing small items, then each bin is 
full to at least (1 — 2e) and if not, we already have the desired number of bins. 


lib ^ 

1.69103 -e' 

i=o * 
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4.2 A (1.7,1 + e)-approximation algorithm 

We now present an algorithm that almost closes the gap with the lower bound of Theorem |2] For this, we will 
use both the APTAS of Fernadez de La Vega and Lueker (VL) |l8]| described above, and the online bin packing 
algorithm Bounded Best-Fit (BBF), whose competitive ratio is 1.7 ll5l. 

BBF : maintain at most k open bins, and the rest are closed and cannot be reopened. An item of size s is 
packed into the open bin that is most full and has space for the item, breaking ties arbitrarily. If no such bin 
exists, the fullest bin is closed and a new empty bin opened. It is known that BBF with k = 2 has (asymptotic) 
competitive ratio 1.7 0. 

Our algorithm is presented in Algorithm[T] It first packs items of each colour separated using the algorithm 
VL. Then given all m packings for each colour in some order, we apply the algorithm BBF over the items in 
the order the items appears in these packings. 

Algorithm 1 A (1.7,l-i-e)-APPROXiMATiON algorithm 
1: Arbitrarily order colours ci... Cm. 

2: for each colour Ci do 

3: pack items of this colour into new bins using the APTAS of (VL) (all bins are monochromatic). 

4: end for 

5: Let P = P{ci) U ... U P{cm) be all bins generated. 

6: Let P' be a new packing initially empty. 

7: for each item e in the order it appears in P do 
8: Pack e into P' with BBF. 

9: end for 

10: Return P'. 


We now prove a lemma that shall be useful in proving the desired approximation ratio of the algorithm. 

Lemma 3 Let b = (i?_(fc_i),..., Bq), be some opened bins that may contain items, and let P = ,..., B^) 

be bins packing items of some set S. Let P' be the packing generated over the items in S by the BBF algorithm 
in the order they appear in P using the bins in b as initially opened. Then the number of used bins by P' is at 
most k + X. 

Proof. Let P' = ..., B'q, B[, ..., By) be the bins in the order they are closed by BBF. We will 

show that any item e € of P for z G {1,..., x} is packed in a bin P' of P' where j ^ i. 

Assume for contradiction that e € Pj is the first item packed in some bin P' with j > i. Since e is the first 
such item of Bi, all previous items e' G P*', z' = 1,..., z — 1 must have been packed in a bin P', with j' ^ i'. 
So bin P' only contains items of Bi. But since P is a valid packing, there must be room for e in P'. □ 

Theorem 4 The algorithm computes (1.7,1 + ej-approximate packings. 

Proof. The time bound follows since the number of colours is polynomial in n, and both algorithms VL and 
BBF run in polynomial time. In steps (1-4) we generate packings P{ci) for each colour Cj such that P{ci) ^ 
(1 -|- 2e)OPT(cj) -b 1. In steps (7-9) of the algorithm it is used the BBF algorithm to pack the items in the 
order they appear in P = (P(ci),..., P{cm))- Since the BBF algorithm keeps at any time k = 2 opened bins, 
by the previous Lemma[3l in the final packing P' we have for each colour q, Pf (I) ^ (1 + 2e)OPT(/ci) + 3. 
Since BBF has approximation factor 1.7 we also have the bound PfI) ^ 1.70PT(/) -|- 0(1) for the entire 
packing. □ 

It is interesting to note that just packing the items of each colour in order using bounded best fit gives a 
(1.7,1.7)-approximate packing (for the bin stretch, ignore colours then the entire packing is L7-approximate, 
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and for colour stretch use Lemma [3] above to get Pc ^ 1.70PT(/c) + 0(1) since the algorithm is bounded 
space). 

Note that in order to have the (1.7,1 + e)-approximation we need a bounded space online algorithm on 
step 8 of Algorithm [TJ but not necessarily any online bounded space algorithm would work. We have to use 
an algorithm that satisfies the property of Lemma[3] Consider the Harmonic algorithm |[T5l for instance. If we 
have an instance consisting of just one colour, then after step 8, the Harmonic algorithm would have separated 
items by types of sizes creating an entire new packing and the (1 + e)-colour stretch would be lost. 

5 Online Algorithms 

We now consider the online version of the problem. Coloured items arrive and must be packed with no 
knowledge of future arrivals. The main difficulty with constructing an online algorithm is that we don’t know 
in advance the total weight of each colour, but on the other hand would like to reserve space so that colours of 
small weight aren’t spread over many bins. 

5.1 Impossibility of online (0(1), 0(l))-approximation 

Even under the restriction that items are ^ e, there is still a lower bound L ^ 1.5403 for the online classical 
bin packing problem, due to by Balogh et al. HI which improved a previous lower bound of 1.5401 by van 
Vliet IITtI . In this case, L is also a lower bound for both bin stretch and colour stretch; we cannot hope to do 
better in either parameter. To see this, consider packing items of only one colour, then the number of bins used 
cannot be smaller than LOPT -|- 0(1). 

We now show that no online (0(1), 0(l))-approximation scheme exists. The idea is to consider items in 
rounds. In each round an optimal packing for the items needs a single extra bin. If a scheme has bin stretch 
0(1) it only needs 0(1) bins per round, but if the instance has a large number of colours then some fraction of 
colours will be forced to split every few rounds, from which it follows that some colour must split in at least a 
constant fraction of rounds. 

Theorem 5 There is no (a, fi)-approximation online algorithm for the coloured bin packing problem where 
(a, /3) are constants. 

Proof. Consider an instance where there are n different colours and where each item has size 1/n. We analyze 
the packing in rounds and in each round we receive a list L of re items of re different colours. There will be at 
most nx rounds where x is the number of bins necessary to pack any colour after rex rounds, and re is going to 
be defined later. 

Let li be the total number of items until round i (there are ni) . Since the optimal packing for bin stretch 
uses i bins, any {a, /?)-algorithm must use at most ai bins by round i. Wlog assume that in each new round 
the algorithm uses at most a bins, since otherwise it will have approximation ratio > a and the adversary will 
stop at this point. The algorithm may use less than a bins in one round and more than a bins in a later round, 
but the average per round must be a. So we can assume that the algorithm opens a bins per round even if 
it will only use some bins in a later round. So we focus on the colour stretch. We will consider at most rex 
rounds, and so there are at most rex items of each colour. Clearly the optimal packing for colour stretch has x 
bins per colour, so A must guarantee that each colour spans at most x/3 bins. 

In each round the algorithm has a bins available and some other bins that were partially filled. Since fhe 
bin sfrefch is guaranteed to be at most a, the only job of the algorithm is to keep the /3 approximation in colour 
stretch. We now show that any algorithm A must incur colour stretch larger than /3 on the request sequence. 
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Define s(i) = a + a^ +... + a*, with s(0) = 0. For i ^ 1, stage i consists of rounds (s(t — 1) +1)... s{i). 
At the end of stage i the algorithm has as{i) bins available, and items of total weight s{i). It can be seen that 

s{i + d) = a'^s{i) + s{d) > a'^s{i). (1) 

We consider groups of d + 1 stages, where d = \ ■ We will show that for every group of d + 1 stages, 
there exists a set of at least n/8 colours that split during this group, i.e colours that need to be packed in more 
bins than the ones available in the beginning of the group. 

Here is the proof of this claim. Assume that less than n/8 colours split during the first d stages of the 
group starting at round s{i) + 1. Then there is a set of at least 7n/8 colours that do not split during the next d 
stages. All items of these colours remain packed in the first as{i) bins. In this case, we have items of weight 

^(s(i + d)-s(i)) + s(t) = ^(7s(i + d) + s(i)) 

going into at most as{i) bins (the last inequality uses ([T])). Now we choose d so that > as{i), which 

is satisfied by taking d = [+ 1. 

This shows that at least 7n/8 (> n/8) colours (say C) must split during the following (i.e. (d + l)th) 
stage. The argument for this is the following: consider all the bins that contained items of colours C at the 
start of the group. All these bins become overfull just by considering the weight of items with colours in C. 
So for every colour c in C", at least one of its bins splits, and so at least \C'\ colours split. 

Clearly, an item of every colour is contained in some bin at the start of every group, so the claim implies 
that after q{d + 1) stages, we have at least qn/S splittings. So taking q > 8x/3, after > 8x/3(d + 1) 

stages, we have had > x/3n splittings, so some colour must have split > xjd times. It remains to choose n 
large enough so that we have at least > s(q|^) rounds. □ 

5.2 An online (3,1.7)-approximation 

In this section we provide an online algorithm that computes (3,1.7)-approximate packings, but we need to 
assume that each item has size at least e > 0, where e is a constant. Note that in the approximation factors 
there is a dependency on the value of e, since the bin stretch is limited by 30PT(/) + 0(log 1/e) and the 
colour stretch is limited by 1.70PT(/) + 0(log 1/e). 

Wlog assume that e = i for some positive integer j. We consider two types of bins: isolated, that 
corresponds to bins packing only items of a given colour, and non-isolated, that may pack items of different 
colours. For each i = 1,... ,j we define some special bins which we call level-t bins. A level-t bin is 
divided in exactly l/2*e regions each of size 2*e for z = 1,..., j. These regions are monochromatic (each 
region contains items of at most one colour). A region in some level-z bin is called a level-i region. We use 
a modified NF algorithm MNF to pack items into non-isolated bins, and switch to BBF to pack colours in 
isolated bins. MNF is similar to NF and a description is given below (Algorithm O. 

Notice that each colour occupies at most one level-z bin, for each level i = 1,... ,j. We say that a colour 
c has level i if z is the largest level of a bin containing items of colour c. The following algorithm uses MNF 
to pack items of the same colour until the colour has level j. When this happens the algorithm starts packing 
items of the colour in isolated bins (the last level-y bin is also considered an isolated bin) using the BBF 
algorithm. A description of the algorithm is given below (Algorithm |3]). 

A region is used when there are items packed on it. A bin is used when all its regions are used. The 
following lemma states that level-z bins that are used, have at least 1/3 of their capacity used by items. Notice 
that for each level z, at most one level-z bin is not using all its regions, since a new level-z bin is created only 
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Algorithm 2 Modified Next Fit (MNF) 

I: To pack item e of colour c and size s(e) 

2: Let i be the highest level where an item of colour c is packed 
3: (Let i = 1 if this is the first item of colour c) 

4: if e can be packed in the level-i region then 
5: Pack e into this level-i region 

6: else 

7: Let I > ihe the lowest level such that 2‘e ^ s(e) 

8: Pack e into a new level-Z region (possibly creating a new level-Z bin) 

9: end if 


Algorithm 3 A (3,1.7)-approximation algorithm. 

I: To pack item e of colour c and size s(e) 

2: if colour c has level < j then 

3: Pack e with MNF in the non-isolated bins 

4: else 

5: Pack e with BBF in the isolated bins. 

6: end if 


when all existing level-z regions are used. So there are at most 0(log -) = 0(1) non-isolated bins that have 
some unused regions. 

Lemma 6 Consider the non-isolated bins that have all their regions in use. On average, each bin has at least 
1/3 of its capacity used by items. 

Proof. We will prove this by considering the levels used by any colour c using non-isolated bins. 

For each colour c, a group is a maximal sequence of regions 2^e, 2^+^e,..., 2^+^’e used by colour c (each 
colour may occupy a number of disjoint groups). We will show that for each group, its regions used by colour 
c have 1/3 of their area occupied. Let 2^e, 2^+ie,..., 2'=+% be a group used by colour c. 

We have two cases: 

• p is odd: Consider the pairs of adjacent regions 

(2^=6,2^+ie),..., {2^+P-^e, 2^+%). 

Since we used MNF to pack the items, for each pair of regions the total weight of items is at least the 
size of the region in the lowest level. Since the higher level region is twice the size of the lower one, 
each pair has at least 1 /3 of its area occupied. 

• p is even: If k ^ 2 then there is an item in the first region 2^e of the group that could not fit in a 

previous used region by colour c. This item was packed in the smallest region with room for it. So this 
item occupies at least 1/2 of region 2^e. \f k = 1 then the assumption that s{e) ^ e implies that this 
region is filled by af leasf 1/2. The remaining regions 2^+^e,..., 2^~^Pe can be paired as in 

the odd case, and for each pair at least 1/3 of its total area is occupied. 


□ 


With this result we can prove the following theorem: 


Theorem 7 The algorithm is a (3, 1.7)-approximation scheme, and uses space at most 0{m). 
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Proof. Since we use BBF to pack isolated bins we can guarantee that on average at least 1/2 of the area of 
the isolated bins is occupied, and for the non-isolated bins, the previous lemma says that at least 1/3 of the 
capacity of these bins is occupied, with the exception of at most 0(log 1/e) bins. So for bin stretch we have a 
bound of 30PT(/) + 0(log 1/e). 

Now we consider the colour stretch. For a colour c using only non-isolated bins, it must use at most 
0(log 1/e) bins, which is a constant. If a colour c also uses isolated bins, then by the performance bound of 
BBF im, it uses at most 1.70PT(/c) -I- 0(log 1/e) bins. The approximation ratio (3,1.7) is then valid if 1/e 
is bounded by a constant. 

For the space bound, BBF uses at most 0(1) open bins per isolated colour, and MNF uses at most one 
open bin per level which is 0 (log 1 /e), a constante. □ 

We may also consider trying to improve the bin stretch bound of 3 by using a variation of FF instead of 
NF in the modified next fit scheme. A ‘modified firsf fif’ MFF works as MNF excepf fhaf sfep 2 is replaced by 
‘lef i be fhe lowesf level occupied by colour c wifh space for e’, so fhaf we try to pack e in each of the used 
regions, packing it in the first such region with space for e. This requires at most 0(log 1/e) open bins per 
colour, but one may expect better performance, bearing in mind that FF beats NF. The following result shows 
that this is not the case, and using MFF provides no improvement in the bin stretch. 

Theorem 8 Using either MFF or MNF, the approximation factor (3,1.7) is tight. 

Proof. Consider that e = i for some positive even integer j. Notice that the regions size are (2e, 4e,... , 1/2,1) 
which is equal to 2 ^’ • • • j ^) !)■ We will consider pairs of colours (c, c'). Assume that for colour c 

we receive j items in the following order: an item of size ^ 3 ^ followed by an item of size 23 ^ + for 
f = 0, 2, 6 ,..., j — 2, where 7 > 0 is arbitrarily small. Then we receive items of colour d. For colour d 
we have j — 2 items in the following order: an item of size ^ 3 ^ + 7 followed by an item of size for 
f = 1,3, ...,j-3. 

Using the MFF (or MNF) algorithm to pack these items, for colour c we will have items i, ^+7, 

7 packed respectively in regions ..., i, 1. For colour d we will have items 234 t+7) 

7 , ^ packed respectively in regions 23^5 2 ?^’ ■ ■ ■ j Pairing items of each colour we can see that they 

use approximately 1/3 of the allocated area for each pair (for small 7 ). So for each colour it uses approximately 
1/3 of the total area allocated to it. An optimal packing of the items of the colours (c, d) uses one bin almost 
full. To see this, note that the sum of the sizes of the items of these eolours is 

J 1 1 °° 

E37 + E(3T+7) = E3T + (i-i)7<E(i/2r = i 

i=2 i=2 i=l i=\ 

for sufficiently small 7 (i.e. for appropriate values of j and letting 7 ^ 0 , the sum of item sizes 

of each pair of colours can be made arbitrarily close to 1. We can then consider arbitrarily large instances by 
using many pairs of colours, thus establishing an asymptotic lower bound on 3 on bin stretch. 

In the same instance we consider a special colour c* where we first receive an item of size 1 and then an 
instance that provides the worst case ratio 1.7 for the BBF algorithm (see US). Then for colour stretch the 
bound 1.7 is also tight. □ 

5.3 An online (2 + e, 1.7)-approximation 

In this section we show how to extend the algorithm of the previous section [5)2] to get an online algorithm that 
computes (2 + e, 1.7)-approximate packings. We assume that each item has size at least e > 0. 
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The algorithm also uses isolated and non-isolated bins. It pack items of colour c in non-isolated bins while 
the total size of packed items of colour c, w{c) ^ g, where g = 1/e. When w{c) > g the algorithm uses 
isolated bins to pack items of colour c. The algorithm is given below (Algorithm lUl. 


Algorithm 4 (2 + e, 1.7)-APPROXiMATiON 
1: Let g = 1/e 

2: For each colour c let w{c) = 0 

3: To pack item e of colour c and size s(e) 

4: if w{c) ^ g then 

5: Pack e into non-isolated bins using FF 

6: w(c) ■(— w{c) -P s(e) 

7: else 

8: Pack e into isolated bins of colour c using FF 

9: end if 


Theorem 9 The algorithm is a (2 + e, 1.7)-approximation. 

Proof. First consider colour stretch. For each colour c, it uses at most g/e non-isolated bins, because each 
item has size at least s. When w{c) > g it packs all items of this colour in isolated bins, and since we use the 
FF algorithm we can bound colour stretch by 1.70PT + 0{g/e). 

Now for bin stretch we have the following. At the end of the execution of the algorithm, it uses A^i non¬ 
isolated bins. It uses some isolated bins as well for large colours (the ones with w{c) > g). There are some 
large colours that uses just one isolated bin, and assume there are k of these large colours. There are some 
other large colours that uses more than one isolated bin, and assume that in total the algorithm uses N 2 bins 
for these large colours. 

Since we used FF to pack the items in non-isolated bins we know that on average each one of the A^i bins 
are full to 1/2. For the same reason the bins used by large colours that uses more than one bin are full on 
average by at least 1/2. So the following bound is valid OPT ^ s(e) ^ (A^i + N2)/2. 

Notice that the algorithm starts to use isolated bins for colour c, only when w{c) > g, and then we also 
have the bound OPT ^ s(e) ^ k ■ g. 

follows 


k 


The approximation ratio of the algorithm is then bounded as 


ratio = 






Ni + N2 + k 
OPT 

Ni + N2 + k 
EeS(e) 

NI + N2 
(iVi + iV2)/2 
2 + 1/g. 


Since g = 1/s the algorithm is a (2 + e, 1.7)-approximation. 


□ 


6 An APTAS to approximate optimal bin stretch 

Our lower bounds show that it is impossible in general to achieve a (1 + e, 1 + e)-approximation. We now 
show that for any colour stretch /3 > 1 and every class of instances that admit a (1, /3)-approximate packing. 
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we can compute a (1 + e, (1 + e)/3)-approximate packing in polynomial time. In this section we consider 
that the number of different colours m is bounded by a constant; relaxing this restriction remains open. It 
is also worth noting that our problem now is slightly different since we are assuming instances that admit a 
(1, /3)-approximate packing, and in general there are instances that do not admit such packings (see Theorem 
□). 

We shall use OPT^(/) to denote the smallest number of bins needed to pack items I using colour stretch 
^ /?. Similarly (by slight abuse of notation), OPT^ c(^) is the number of bins spanned by colour c in such a 
packing. Clearly, OPT^^c(.^) ^ /30PT(/c). The scheme we describe below computes a packing P satisfying 
(1) total number of bins P{I) ^ (1 + e)OPT^(/) + 0(1), and (2) for each colour c, it uses at most Pc{I) ^ 
(1 + e)OPT^,c(/) + 0(1) ^ /3(1 + e)OPT(/e) + 0(1) bins. 

The idea is to use a variant of the grouping and rounding technique, but to explicitly work with the instance 
where items are rounded down. We are able to show that by packing some very large items and very small 
items separately, only a few items ‘overflow’ from some optimal packing OPT, and thus we can still achieve 
the desired colour stretch and bin stretch. 

Denote by P the items in I with size at least (large items), and P the remaining items in I (small items). 

Packing large items: Partition the large items by colour: P = Ii, ... ,Im and let ric = \Ic\ be the number 
of items of each colour c. Then sort each colour Ic by decreasing order of item size and partition it into at most 
M= [1/e^] groups/cl, Ic 25 ■■■ j .^cM^ be Ic = ^ci II ••• ll^cM where II is a concatenation operator. Each group 
has [nce^J items except perhaps the last. 

For each group of each colour, round down the items to the size of the smallest item in the group (by 
contrast with VL, who round up item sizes). As before, we can enumerate all such packings: the number 
of items per bin is at most y ^ 1/e^, and the number of distinct item sizes is a constant mM (recall m is 
the number of colours and is assumed to be a constant). Thus, there are at most r' = different bin 

configurations. We shall do something more involved with the small items, so we shall attach to each bin 
configuration a subset of colours that shall be used for the small items later on. This gives at most r = r'2"* 
total configurations so the number of feasible packings into at most n bins is bounded by ^ (n + rp. 

Notice that among the configurations there are some that may contain bins with no large items, and just have 
a subset of colours attached to show that small items can be packed later. 

We enumerate all such packings, and keep only those that have colour stretch at most /3 (ignoring the 
additive constant). One of these packings corresponds exactly to an optimal packing after removing its small 
items and with its large items rounded down. 

A similar domination argument to before will now show that at least one of these packings has close to the 
desired colour stretch and bin stretch. Let P be one of the enumerated packings with colour stretch at most /3. 
Since item sizes were rounded down, each group Icj in P gives a packing for the items with orignal sizes in 
the next group Ic{j+i) (all these items have smaller size than the previous group). The only items not packed 
by this are those in the first group (with largest size) - denote these items by Q = U)!h]^/ci. 

The ‘very large’ items in Q are packed into new bins using first fit (FF), considering all items of one 
colour before the next colour. Let P{Q) be the size of the packing obtained in this way. The following simple 
argument shows that these very large items will contribute only a small amount to the total bin and colour 
stretch. 

Lemma 10 P{Q) ^ eOPT(/) and P{Qc) ^ eOPT{Ic) for each colour c. 

Proof. Clearly first fit packs at least one item per bin. Since IQj ^ each item has size at least 

we have P{Q) ^ eOPT(/). Since we apply first fit to items grouped by colour, the same argument establishes 
the claim for colour stretch. □ 
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Packing small items. Let P = {Bi,..., B^} be a packing of the large items iK We now wish to pack 
carefully the small items P into P. 

The packing of the small items is obtained from a solution of a linear program. Recall that when enumer¬ 
ating packings of large items, each bin was tagged with a subset of colours that could be used to pack small 
items. Let Ni C {1, ..., m} be the set of possible colours that may be used to pack the small items in the bin 
Bi of the packing P. For each colour c ^ Ni, define a non-negative variable x®. The variable x® indicates the 
total size of small items of colour c to be packed in the bin B^. Denote by the total size of items already 
packed in the bin B^. Consider the program denoted by LPS: 

k 


max^^ X® 

s.t. 


i=l c£Ni 



s{Bi) 

^1 z = l,.. 

.,k (1) 

ceNi 

k 



2 = 1 

^ s(/^) c=l,.. 

., m (2) 


where is the set of small items of colour c in I. The constraint (1) guarantees that the total size of items 
packed in each bin does not exceed the bins size and constraint (2) guarantees that the sum of the values of 
variables x® is not greater than the total size of small items. 

Given a packing P, and a fractional packing of the small items, we do the following: for each variable x® 
we pack, while possible, the small items of colour c into the bin Bi, so that the total size of the packed small 
items is at most x®. The possible remaining small items (the ‘overflowing’ items) are packed using FF into 
new bins, again grouped by colour (meaning pack all items of one colour before the next). 

Algorithm 5 APTAS (/) 

Require: Number of different colours m in / is 0(1) 

1 : Fix e > 0 

2: Split items I into small (< e^) P and large (^ e^) P 
3: Group large items by colour and sort by decreasing size 

4: Group large items of each colour into [1/e^] groups and round item sizes down in each group 
5: Enumerate all packings of large items, with attached ‘small colours’ labels 
6 : For each colour c, pack remaining ‘very large’ items Qc using FF 
7: For each packing P, solve LPS to add small items 

8 : For each packing P, pack the overflowing small items into new bins using FF 
9: Return the best packing that has colour stretch at most /3 


Approximation ratio. We will claim that there exists a packing P such that after the very large items Q and 
the small items P have been packed into P, it has the desired bin and colour stretch. In particular, at least one 
packing uses at most (1 -|- 0(e))0PT^(/) -|- 0(1) bins in total and at most (1 -|- 0(e))0PT^^c(^) + 0(1) ^ 
f3{l + O(e))OPT(Ic) -|- 0(1) bins for each colour c. 

Theorem 11 Let fl > 1 be the desired colour stretch. The algorithm finds a packing P such that P{I) ^ 
(1 -|- 0(e))0PT^(/) -|- 0(1), and Pc{I) ^ /3(1 + 0(e))0PT(/c) + 0{l)for each colour c. 

Proof. Let OPT^ be an optimal packing for the instance I with colour stretch /3. Let OPT|^ be the packing 
OPT p without the small items and with the large items rounded down as described. Assume that each bin of 
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OPT|^ has an indication of the colours of small items used in the corresponding bin of OPT^. Clearly in the 
enumeration step of the algorithm one packing with the same configuration of OPT|^ with rounded items, is 
generated. This gives a packing P for the original items in P \ Q. Notice that the number of bins used by P 
and OPT p is the same. The very large items in Q are packed separately. 

In the packing P there must be enough room to pack all small items, since there is in OPT p. When packing 
the small items (guided by the fractional packing LPS), at most one small item of each colour is not packed 
into each desired bin. So, the total size of small items that overflow and need fo be packed info new bins 


IS 


af mosf OPT^(/)e^m. These small ifems use af mosf 


OPT^(/)e^m 


+ 1 new bins, since each bin is full 


fo af leasf (1 — e^) excepf perhaps by fhe lasf one. Considering colour sfrefch, each colour c uses af mosf 

OPTnjPP 


(i-e") 


+ 1 new bins. 

The algorifhm packs fhese small items in new bins obfaining a new packing P'{I \ Q). The number of 
bins is af mosf 


P'{I\Q) ^ OPT^(/) + 


1 ^ 

(1 + 0(£))0PT^(/) + (7(1). 


+ 1 


Considering colour sfrefch we have, for each colour c. 


P'{Ic\Qc) ^ OPT^(I,) + 


OPT^(/e 


l-e2 

(l + 0(e))0PT;3(/c) + 0(l). 


+ 1 


( 2 ) 

(3) 

(4) 

(5) 


To finish fhe proof, if remains fo consider fhe very large items Q. For fhese. Lemma [TO] shows fhaf fhey 
need an exfra e fracfion of bins for each colour and in fofal. Nofice fhaf in order fo obfain a fruly (1 + e) 
approximated solution, we need fo rescale fhe value of e, for example by using e' = e/m due fo fhe faclor 
m multiplying e in fhe term OPTp{I)£‘^m on equation (2). So fhe running fime of fhe enfire algorifhm 


dominated by fhe enumeration step which is ) = 0(rp^^'^IPP' ) 


IS 


□ 


7 Open Problems 

• Improved approximation ratio. Can we get an online algorithm with (1.7 + e,1.7 + e) approximation 
ratio, with the assumption on minimum item sizes? 

• Multicoloured items. The multicoloured case is also interesting: fix a set of (possibly unbounded) 
colours C, and let each item have several (say at most k) colours from C. The original definitions of 
colour and bin stretch still apply. By allowing k copies of each item to be packed, it is certainly possible 
to reuse any {a, /3)-approximation algorithm in this paper to construct one with bin stretch ka and colour 
stretch f3. Is it possible to do better? 

• Network packing version. Let the items form a graph as follows: the vertices are the items, and the 

(weighted or unweighted) distance between two items is a measure of how closely together the items 
should be ‘packed’. The notion of bin stretch is as before, and colour stretch is replaced by the following 
notion of ‘strong diameter stretch’: for a set of vertices X, let diam(2f) be their ‘strong diameter’, i.e. 
maxu^vex do{u,v). Let B{X) be the bins spanned by items in X. Then strong diameter stretch is 
maxxcy What bounds can we achieve when using this quantity, and does it depend on eg. the 

expansion of G? 
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